NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lightweight Diagramming for Lightweight Formal Methods: A Grounded Language Design

https://doi.org/10.4230/lipics.ecoop.2025.26

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Aldrich, Jonathan; Silva, Alexandra (Ed.)
Tools such as Alloy enable users to incrementally define, explore, verify, and diagnose specifications for complex systems. A critical component of these tools is a visualizer that lets users graphically explore generated models. As we show, however, a default visualizer that knows nothing about the domain can be unhelpful and can even actively violate presentational and cognitive principles. At the other extreme, full-blown custom visualization requires significant effort as well as knowledge that a tool user might not possess. Custom visualizations can also exhibit bad (even silent) failures. This paper charts a middle ground between the extremes of default and fully-customizable visualization. We capture essential domain information for lightweight diagramming, embodying this in a language. To identify key elements of lightweight diagrams, we ground the language design in both the cognitive science research on diagrams and in a corpus of 58 custom visualizations. We distill from these sources a small set of orthogonal primitives, and use the primitives to guide a diagramming language called Cope-and-Drag (CnD). We evaluate it on sample tasks, three user studies, and performance, and find that short CnD specifications consistently improve model comprehension over the Alloy default. CnD thus defines a new point in the design space of diagramming: a language that is lightweight, effective, and driven by sound principles.
more » « less
Full Text Available
Misconceptions in Finite-Trace and Infinite-Trace Linear Temporal Logic

Greenman, Ben; Prasad, Siddhartha; Di_Stasio, Antonio; Zhu, Shufang; De_Giacomo, Giuseppe; Krishnamurthi, Shriram; Montali, Marco; Nelson, Tim; Zizyte, Milda (September 2024, Springer)

Full Text Available
Conceptual Mutation Testing for Student Programming Misconceptions

https://doi.org/10.22152/programming-journal.org/2024/8/7

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (December 2023, The Art, Science, and Engineering of Programming)

Full Text Available
Generating Programs Trivially: Student Use of Large Language Models

https://doi.org/10.1145/3576882.3617921

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (December 2023, ACM)

Full Text Available
Misconceptions in Finite-Trace and Infinite-Trace Linear Temporal Logic

https://doi.org/10.1007/978-3-031-71162-6_30

Greenman, Ben; Prasad, Siddhartha; Di_Stasio, Antonio; Zhu, Shufang; De_Giacomo, Giuseppe; Krishnamurthi, Shriram; Montali, Marco; Nelson, Tim; Zizyte, Milda (September 2024, Springer Nature Switzerland)

Abstract With the growing use of temporal logics in areas ranging from robot planning to runtime verification, it is critical that users have a clear understanding of what a specification means. Toward this end, we have been developing a catalog of semantic errors and a suite of test instruments targeting various user-groups. The catalog is of interest to educators, to logic designers, to formula authors, and to tool builders, e.g., to identify mistakes. The test instruments are suitable for classroom teaching or self-study. This paper reports on five sets of survey data collected over a three-year span. We study misconceptions about finite-trace$$\textsc {ltl}_{f}$$ ${L T L}_{f}$ in threeltl-aware audiences, and misconceptions about standardltlin novices. We find several mistakes, even among experts. In addition, the data supports several categories of errors in both$$\textsc {ltl}_{f}$$ ${L T L}_{f}$ andltlthat have not been identified in prior work. These findings, based on data from actual users, offer insights into whatspecific waystemporal logics are tricky and provide a groundwork for future interventions.
more » « less
Full Text Available
Forge: A Tool and Language for Teaching Formal Methods

https://doi.org/10.1145/3649833

Nelson, Tim; Greenman, Ben; Prasad, Siddhartha; Dyer, Tristan; Bove, Ethan; Chen, Qianfan; Cutting, Charles; Del_Vecchio, Thomas; LeVine, Sidney; Rudner, Julianne; et al (April 2024, Proceedings of the ACM on Programming Languages)

This paper presents the design ofForge, a tool for teaching formal methods gradually. Forge is based on the widely-used Alloy language and analysis tool, but contains numerous improvements based on more than a decade of experience teaching Alloy to students. Although our focus has been on the classroom, many of the ideas in Forge likely also apply to training in industry. Forge offers aprogression of languagesthat improve the learning experience by only gradually increasing in expressive power. Forge supportscustom visualizationof its outputs, enabling the use of widely-understood domain-specific representations. Finally, Forge provides a variety oftesting featuresto ease the transition from programming to formal modeling. We present the motivation for and design of these aspects of Forge, and then provide a substantial evaluation based on multiple years of classroom use.
more » « less
Full Text Available
Little Tricky Logic: Misconceptions in the Understanding of LTL

https://doi.org/10.22152/programming-journal.org/2023/7/7

Greenman, Ben; Saarinen, Sam; Nelson, Tim; Krishnamurthi, Shriram (November 2022, The Art, Science, and Engineering of Programming)

Context Linear Temporal Logic (LTL) has been used widely in verification. Its importance and popularity have only grown with the revival of temporal logic synthesis, and with new uses of LTL in robotics and planning activities. All these uses demand that the user have a clear understanding of what an LTL specification means. Inquiry Despite the growing use of LTL, no studies have investigated the misconceptions users actually have in understanding LTL formulas. This paper addresses the gap with a first study of LTL misconceptions. Approach We study researchers’ and learners’ understanding of LTL in four rounds (three written surveys, one talk-aloud) spread across a two-year timeframe. Concretely, we decompose “understanding LTL” into three questions. A person reading a spec needs to understand what it is saying, so we study the mapping from LTL to English. A person writing a spec needs to go in the other direction, so we study English to LTL. However, misconceptions could arise from two sources: a misunderstanding of LTL’s syntax or of its underlying semantics. Therefore, we also study the relationship between formulas and specific traces. Knowledge We find several misconceptions that have consequences for learners, tool builders, and designers of new property languages. These findings are already resulting in changes to the Alloy modeling language. We also find that the English to LTL direction was the most common source of errors; unfortunately, this is the critical “authoring” direction in which a subtle mistake can lead to a faulty system. We contribute study instruments that are useful for training learners (whether academic or industrial) who are getting acquainted with LTL, and we provide a code book to assist in the analysis of responses to similar-style questions. Grounding Our findings are grounded in the responses to our survey rounds. Round 1 used Quizius to identify misconceptions among learners in a way that reduces the threat of expert blind spots. Rounds 2 and 3 confirm that both additional learners and researchers (who work in formal methods, robotics, and related fields) make similar errors. Round 4 adds deep support for our misconceptions via talk-aloud surveys. Importance This work provides useful answers to two critical but unexplored questions: in what ways is LTL tricky and what can be done about it? Our survey instruments can serve as a starting point for other studies.
more » « less
Full Text Available
Applying cognitive principles to model-finding output: the positive value of negative information

https://doi.org/10.1145/3527323

Dyer, Tristan; Nelson, Tim; Fisler, Kathi; Krishnamurthi, Shriram (April 2022, Proceedings of the ACM on Programming Languages)

Model-finders, such as SAT/SMT-solvers and Alloy, are used widely both directly and embedded in domain-specific tools. They support both conventional verification and, unlike other verification tools, property-free exploration. To do this effectively, they must produce output that helps users with these tasks. Unfortunately, the output of model-finders has seen relatively little rigorous human-factors study. Conventionally, these tools tend to show one satisfying instance at a time. Drawing inspiration from the cognitive science literature, we investigate two aspects of model-finder output: how many instances to show at once, and whether all instances must actually satisfy the input constraints. Using both controlled studies and open-ended talk-alouds, we show that there is benefit to showing negative instances in certain settings; the impact of multiple instances is less clear. Our work is a first step in a theoretically grounded approach to understanding how users engage cognitively with model-finder output, and how those tools might better support users in doing so.
more » « less
Full Text Available
Prototyping Formal Methods Tools: A Protocol Analysis Case Study

https://doi.org/10.1007/978-3-030-91631-2_22

Siegel, Abigail; Santomauro, Mia; Dyer, Tristan; Nelson, Tim; Krishnamurthi, Shriram (November 2021, Lecture notes in computer science)

Modern-day formal methods tools are more than just a core solver: they also need convenient languages, useful editors, usable visualizations, and often also scriptability. These are required to attract a community of users, to put ideas to work in practice, and to conduct evaluations of the formalisms and core technical ideas. Off-the-shelf solvers address one of these issues but not the others. How can full prototype environments be obtained quickly? We have built Forge, a system for prototyping such environments. In this paper, we present a case-study to assess the utility of Forge. Concretely, we use Forge to build a basic protocol analyzer, inspired by the Cryptographic Protocol Shape Analyzer (CPSA). We show that we can obtain editing, basic visualization, and scriptability at no extra cost beyond embedding in Forge, and a modern, domain-specific visualization for relatively little extra effort.
more » « less
Full Text Available
Automated, Targeted Testing of Property-Based Testing Predicates

https://doi.org/10.22152/programming-journal.org/2022/6/10

Nelson, Tim; Rivera, Elijah; Soucie, Sam; Del Vecchio, Thomas; Wrenn, John; Krishnamurthi, Shriram (November 2021, The Art, Science, and Engineering of Programming)

Full Text Available

« Prev Next »

Search for: All records